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PRUIT-SPECIPIC 
TRANSCRIPTIONAL FACTORS 



CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of 
Application Serial No. 168,190, filed March 15, 1988, 
which is a continuation-in-part of Application Serial 
No. 054,369 filed May 26, 1987, which applications are 
incorporated herein by reference. 

INTRODUCTION 

. Technical Field 

This invention relates to DNft expression cas- 
settes capable of directing fruit-specific expression 
of in vitro constructed expression cassettes in plants 
The invention is exemplified by promoters useful in 
fruit-specific transcription in a tomato plant. 

Background 

Manipulation of plants has proven to be 
significantly more difficult than manipulation of 
prokaryotes and mammalian hosts. As compared to 
prokaryotes and mammalian cells, much less wais known 
about the biochemistry and cell biology of plant cells 
and plants. The ability to transform plant cells and 
regenerate plants is unique to flora since other 
differentiated species provide readily available trans- 
formable germ cells which may be fertilized and intro- 
duced into the live host for fetal development to a 
mature fetus. There has been substantial interest in 
modifying the ovum with inducible transcriptonal initi- 
ation regions to afford inducible transcription and 
expression of the gene introduced into the ovum, rather 
than constituitive expression which would result in ex- 
pression throughout the fetus. 



Also, for plants, it is frequently desirable 
to be able to control expression at a particular stage 
in the growth of the plant or in a particular plant 
part. During the various stages of the growth of the 
plant, and as to the various components of the plant, 
it will frequently be desirable to direct the effect of 
the construct introduced into the entire plant or a 
particular part and/or to a particular stage of differ- 
entiation of the plant cell. For this purpose, regula- 
tory sequences are required which afford the desired 
initiation of transcription in the appropriate cell 
types and/or at the appropriate time in the plant de- 
velopment, without having serious detrimental effects 
on the plant development and productivity. 

It is therefore important to be able to iso- 
late sequences which can be manipulated to provide the 
desired regulation of transcription in a plant cell 
host during the growing cycle of the plant. One aspect 
of this interest is the ability to change the phenotype 
of fruit, so as to provide fruit which will have im- 
proved aspects for storage, handling, cooking, organ- 
oleptic properties, freezing, nutritional value, and 
the like. 

Relevant Literature • 

cDNA clones from tomato displaying differen- 
tial expression during fruit development have been iso- 
lated and characterized (Mansson et al. , Mol. Gen. 
Gg^et. (1985) 200:356-361; Slater et al.. Plant Mol. 
Biol^ (1985) 5:137-147). The studies have focused pri- 
marily on mRNAs which accumulate during fruit ripening. 
One of the proteins encoded by the ripening-specific 
cDNAs has been identified as polygalacturonase (Slater 

al. , Plant Mol. Biol. (1985) 5:137-147). A cDNA 
clone which encodes tomato polygalacturonase has been 
sequenced. Grierson et al. , Nucleic Acids Research 
(1986) 14:8395-8603. The concentration of polygalac- 



turonase mRNA increases 2000-fold between the immature- 
green and red-ripe stages of fruit development. This 
suggests that expression of the enzyme is regulated by 
the specific mRNA concentration which in turn is regu- 
lated by an increase in transcription. Delia Penna et 

Proc. Natl. Acad. Sci. PSA (1986) 82:6420-6424. 
Mature plastid mRNA for psbA (one of the components of 
photosystem II) reaches its highest level late in fruit 
development, whereas after the onset of ripening, plas- 
tid mRNAs for other components of photosystem I and II 
decline to nondetectable levels in chroraoplasts. 
Piechulla et al. , Plant Mol. Biol. (1986) 7:367-376. 

Other studies have focused on cDNAs encoding 
genes under inducible regulation, e.g. proteinase in- 
hibitors which are expressed in response to wounding in- 
tomato (Graham et al., J. Biol. Chem. (1985) 260:6555- 
6560; Graham et al. , J. Biol. Chem. (1985) 260:6561- 
6564) and on mRNAs correlated with ethylene synthesis 
in ripening fruit and leaves after wounding. Smith et 
al., Planta (1986) 168:94-100. 

Leaf disc transformation of cultivated tomato 
is described by McCormick, et al.. Plant Cell Reports 
(1986) 5:81-89. 

SUMMARY OF THE INVENTION 
Novel DNA constructions are provided employing 
a "fruit-specific promoter," particularly those active 
beginning at or shortly after anthesis or beginning at 
the breaker stage,, joined to a DNA sequence of interest 
and a transcriptional termination region. A DNA 
construct may be introduced into a plant cell host for 
integration into the genome and transcription regulated 
at a- time at or subsequent to anthesis. In this 
manner, high levels of RNA and, as appropriate, 
polypeptides, may be achieved during formation and/or 
ripening of fruit. 



BRIEF DESCRIPTION OF TEE DRAWINGS 
Figure 1 shows the nucleotide sequence of the 
cDNA clones pCGN1299 (2A11) and pCGN1298 (3H11). The 
amino acid sequence of the polypeptide encoded by the 
open reading frame is also indicated. 

Figure 2 is a comparison of 2A11 to pea stor- 
age proteins and other abundant storage proteins: 

(a) 2A11 (residues 33-46) is compared 
to PAlb and the reactive site sequences of some prote- 
ase inhibitors, PAlb (residues 6-23), chick pea inhibi- 
tor (residues 11-23), lima bean inhibitor (residues. 
23-35), human al-antitrypsin reactive site peptide. 

The arrow indicates the reactive site. 

(b) is a comparison of the amino ter- 
minal sequence of 2A11 with the amino termini of a 
range of seed proteins. The data have been modified or 
deletions introduced to ma:cimi2e homology; conserved 
residues are shown boxed. The sequences are from the 
following sources: PAlb; barley chloroform/methanol- 
soluble protein d; wheat albumin; wheat o-amylase 
inhibitor 0*28; millet bi-functional inhibitor; castor 
bean 2S small subunit; and napin small subunit. 

Figure 3 is a schematic diagram of the con- 
struction of the binary plasmid pCGN783; (a) through 
(f ) refer to the plasmid constructions in Example 6.1. 

Figure 4 shows the complete sequence of the 
2A11 genomic DNA cloned into pCGN1273 from the Xho l 
site (position 1 at the 5' end) to the EcoRI site 
(position 4654). 

Figure 5 shows the nucleotide sequence of a 
polygalacturonase (PG) genomic clone. 

DESCRIPTION OF TBE SPECIFIC EMBODIMENTS 
In accordance with the subject invention, DNA 
constructs are provided which allow for modification of 
plant phenotype during fruit maturation and ripening. 
The DNA constructs provide for a regulated transcrip- 



tional initiation region associated with fruit develop- 
ment and ripening. Downstream from and under the tran- 
scriptional initiation regulation of the fruit related 
initiation region will be a sequence of interest which 
5 will provide for modification of the phenotype of the 
fruit. Desirably, integration constructs may be pre- 
pared which allow for integration of the transcrip- 
tional cassette into the genome of a plant host. Con- 
veniently, the vector may include a multiple cloning 

10 site downstream from the fruit related transcriptional 
initiation region, so that the integration construct 
may be employed for a variety of sequences in an 
efficient manner. 

Of particular interest is a transcriptional 

15 initiation region which is activated at or shortly 

after anthesis, so that in the early development of the 
fruit, it provides the desired level of transcription 
of the sequence of interest. Normally, the sequence of 
interest will be involved in affecting the process in 

20 the early formation of the fruit or providing a proper- 
ty which is desirable during the growing (expansion) 
period of the fruit, or at or after harvesting. 

The ripening stages of the tomato may be 
broken down into mature green, breaker, turning, pink, 

25 light red and red. Desirably, the transcriptional 
initiation region maintains its activity during the 
expansion and maturation of the green fruit, more de- 
sirably continues active through the ripening or red 
fruit period. Comparable periods for other fruit are 
. 30 referred to as stages of ripening. The invention is 

not limited to those transcriptional initiation regions 
which are activated at or shortly after anthesis but 
also includes transcriptional initiation regions which 
are activated at any of the ripening stages of the 

35 fruit. 



The transcriptional initiation region may be 
native or homologous to the host or foreign or heterol- 
ogous to the host. By foreign is intended that the 
transcriptional initiation region is not found in the 
5^ wild-type host into which the transcriptional initia- 

■■MM 

tion region is introduced. Of particular interest is a 
tomato fruit-specific transcriptional initiation region 
referred to as 2A11 which regulates the expression of a 
2A11 cDNA sequence described in the Experimental sec-. 

10 tion. The 2A11 transcriptional initiation region pro- 
vides for an abundant messenger^ being activated at or 
shortly after anthesis and remaining active until the 
red fruit stage. The expressed protein is a sulfur- 
rich protein similar to other plant storage proteins in 

15 sulfur content and size. Also of interest is the tran- 
scriptional initiation region which regulates expres- 
sion of the enzyme polygalacturonase, an enzyme which 
plays an important role in fruit ripening. The poly- 
galacutonase promoter is active in at least the breaker 

20 through red fruit stage. 

Othfer fruit-specific promoters may be acti- 
vated at times subsequent to anthesis / such as prior to 
or during the green fruit stage, during pre-ripe (e.g./ 
breaker) or even into the red fruit stage. 

25 A transcriptional initiation region may be em- 

ployed for varying the phenotype of the fruit. Various 
changes in phenotype are of interest. These changes 
may include up- or down-regulation of formation of a 
particular saccharide, involving mono- or polysacchar- 

30 ides, involving such enzymes as polygalacturonase, 

levansucrase, dextransucrase, invertase, etc.; enhanced 
lycopene biosynthesis; cytokinin and monellin synthesis. 
Other properties of interest for modification include 
response to stress, organisms, herbicides, bruising, 

35 mechanical agitation, etc., change in growth regula- 
tors, organoleptic properties, etc. For antisense or 
complementary sequence transcription, the sequence will 



usually be at least 12, more usually at least 16 nt. 
Antisense sequences of interest include those of poly- 
galacturonase, sucrase synthase and invertase. 

The transcriptional cassette will include in 
the 5 '-3' direction of transcription, a transcriptional 
and transla.tional initiation region, a sequence of in- 
terest, and a transcriptional and translational termi- 
nation region functional in plants. One or more in- 
trons may be also be present. The DNA sequence may 
have any open reading frame encoding a peptide of 
interest/ e.g. an enzyme, or a sequence complementary 
to a genomic sequence, where the genomic sequence may 
be an open reading frame, an intron, a non-coding 
leader sequence, or any other sequence where the com- 
plementary sequence will inhibit transcription, mes- 
senger RNA processing, e.g. splicing, or translation. 
The DNA sequence of interest may be synthetic, natur- 
ally derived, or combinations thereof. Depending upon 
the nature of the DNA sequence of interest, it may be 
desirable to synthesize the sequence with plant pre- 
ferred codons. The plant preferred codons may be de- 
termined from the codons of highest frequency in the 
proteins expressed in the largest amount in the partic- 
ular plant species of interest. 

In preparing the transcription cassette, the 
various DNA fragments may be manipulated, so as to pro- 
vide for the DNA sequences in the proper orientation 
and, as appropriate, in the proper reading frame. To- 
ward this end, adapters or linkers may be employed for 
joining the DNA fragments or other manipulations may be 
involved to provide for convenient restriction sites, 
removal of superfluous DNA, removal of restriction 
sites, or the like. Toward this end, in vitro muta- 
genesis, primer repair, restriction, annealing, resec- 
tion, ligation, or the like may be employed, where in- 
sertions, deletions or substitutions, e.g. transitions 
and transversions, may be involved. 
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The termination region which is employed will 
be primarily one of convenience, since the termination 
regions appear to be relatively interchangeable. The 
termination region may be native with the transcrip- 
tional initiation region, may be native with the DNA 
sequence of interest/ or may be derived from another 
source. Convenient termination regions are available 
from the Ti-plasmid of A. tumefaciens, such as the oc- 
topine synthase and nopaline synthase termination 
regions. 

By appropriate manipulations, such as restric- 
tion, chewing back or filling in overhangs to provide 
blunt ends, ligation of linkers, or the like, comple- 
mentary ends of the fragments can be provided for join- 
ing and ligation. 

In carrying out the various steps, cloning is 
employed, so as to amplify the amount of DNA and to al- 
low for analyzing the DNA to ensure that the operations 
have occurred in proper manner. A wide variety of 
cloning vectors are available, where the cloning vector 
includes a replication system functional in E. coli and 
a marker which allows for selection of the transformed 
cells. Illustrative vectors include pBR332, pOC ser- 
ies, M13mp series, pACYC184, etc. Thus, the sequence 
may be inserted into *the vector at an appropriate re- 
striction site{s), the resulting plasmid used to trans- 
form the E. coli host, the E. coli grown in an appro- 
priate nutrient medium and the cells harvested and 
lysed and the plasmid recovered. Analysis may involve 
sequence analysis, restriction analysis, electrophore- 
sis, or the like. After each manipulation the DNA se- 
quence to be used in the final construct may be re- 
stricted and joined to the next sequence, where each of 
the partial constucts may be cloned in the same or 
different plasmids. 



In addition to the transcription construct , 
depending upon the manner of introduction of the trans- 
cription construct into the plant, other DNA sequences 
may be required. For example / when using the Ti- or 
Ri-plasmid for transformation of plant cells, as de- 
scribed below, at least the right border and frequently 
both the right and left borders of the T-DNA of the Ti- 
or Ri-plasmids will be joined as flanking regions to 
the transcription construct. The use of T-DNA for 
transformation of plant cells has received extensive 
study and is amply described in EPA Serial No. 120,516, 
Hoekema, In: The Binary Plant Vector System Offset- 
drukkerij Ranters B.V., Alblasserdam, 1985, Chapter V, 
Knauf et al.. Genetic Analysis of Host Range Expression 
t>y Aqrobacterium . In:Molecular Genetics of the Bacteria- 
Plant Interaction, Puhler, A. ed. , Springer-Verlag, NY, 
1983, p. 245, and An et al . , EM30 J. (1985) 4:277-284 

Alternatively, to enhance integration into the 
plant genome, terminal repeats of transposons may be 
used as borders in conjunction with a transposase. In 
this situation, expression of the transposase should be 
inducible, so that once the transcription construct is 
integrated into the genome, it should be relatively 
stably integrated and avoid hopping. 

The transcription construct will normally be 
joined to a marker for selection in plant cells. Con- 
veniently, the marker may be resistance to a biocide, 
particularly an antibiotic, such as kanamycin, G418, 
bleomycin, hygromycin, chloramphenicol, or the like. 
The particular marker employed will be one which will 
allow for selection of transformed cells as compared to 
cells lacking the DNA which has been introduced. 

A variety of techniques are available for the 
introduction of DNA into a plant cell host. These 
techniques include transformation with Ti-DNA employing 
A. tumef aciens or A. rhizoqenes as the transforming 
agent, protoplast fusion, injection, electroporation. 
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etc. For transformation with Agrobacterium , plasmids 
can be prepared in E. coli which plasmids contain DNA 
homologous with the Ti-plasmid, particularly T-DNA. 
The plasmid may or may not be capable of replication in 
Aqrobacterium, that is, it may or may not have a broad 
spectrum prokaryotic replication system, e.g., RK290, 
depending in part upon whether the transcription con- 
struct is to be integrated into the Ti-plasmid or be 
retained on an independent plasmid. By means of a 
helper plasmid, the transcription construct may be 
transferred to the A. tumefaciens and the resulting 
transformed organism used for transforming plant cells. 

Conveniently, explants may be cultivated with 
the A. tumefaciens or A. rhizoqenes to allow for trans- 
fer of the transcription construct to the plant cells, 
the plant cells dispersed in an appropriate selective 
medium for selection, grown to callus, shoots grown and 
plantlets regenerated from the callus by growing in 
rooting medium. The Aqrobacterium host will contain a 
plasmid having the vir genes necessary for transfer of 
the T-DNA to the plant cells and may or may not have 
T-DNA. For injection and electroporation, disarmed 
Ti-plasmids (lacking the tumor genes, particularly the 
T-DNA region) may be introduced into the plant cell. 

As a host cell, any of a number of fruit bear- 
ing plants may be employed in which the plant parts of 
interest are derived from the ovary wall. These in- 
clude true berries such as tomato, grape, blueberry, 
cranberry, currant, and eggplant; stone fruits (drupes) 
such as cherry, plum, apricot, peach, nectarine and 
avocado; compound fruits (druplets) such as raspberry 
and blackberry, in hesperidium (oranges, citrus), the 
expression cassette might be expected to be expressed 
in the "juicy" portion of the fruit. In pepos (such as 
watermelon, cantelope, honeydew, cucumber and squash) 
the equivalent tissue for expression is most likely the 
inner edible portions, whereas in legumes (such as 
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peas, green beans, soybeans) the equivalent tissue is 
the seed pod. 

Identifying useful transcriptional initiation 
regions may be achieved in a number of ways. Where a 
fruit protein has been or is isolated, it may be par- 
tially sequenced, so that a probe may be designed for 
identifying messenger RNA specific for fruit. To fur- 
ther enhance the concentration of the messenger RNA 
specifically associated with fruit, cDNA may be pre- 
pared and the cDNA subtracted with messenger RNA or 
cDNA from non-fruit associated cells. The residual 
cDNA may then be used for probing the genome for com- 
plementary sequences, using an appropriate library 
prepared from plant cells. Sequences which hybridize 
to the cDNA may then be isolated, manipulated, and the 
5 '-untranslated region associated with the coding re- 
gion isolated and used in espression constructs to 
identify the transcriptional activity of the 5 '-un- 
translated region. In some instances, a probe may be 
employed directly for screening a genomic library and 
identifying sequences which hybridize to the probe. 
The sequences will be manipulated as described above to 
identify the 5 ' -untranslated region. 

As an example, a promoter of particular inter- 
est for the subject invention, the fruit-specific tran- 
scriptional initiation region (promoter) from a DNA 
sequence which encodes a protein described as 2A11 in 
the Experimental section was identified as follows. 
cDNA clones made from ripe fruit were screened using 
cDNA probes made from ripe fruit, green fruit, and leaf 
mRNA. Clones were selected having more intense hybrid- 
ization with the fruit DNAs as contrasted with the leaf 
cDNAs. The screening was repeated to identify a par- 
ticular CDNA referred to as 2A11. The 2A11 cDNA was 
then used for screening RNA from root, stem, leaf, and 
seven stages of fruit development after the mRNA was 
sized on gels. The screening demonstrated that the 
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particular message was present throughout the seven 
stages of fruit development. The mRNA complementary to 
the specific cDNA was absent in other tissues which 
were tested. The cDNA was then used for screening a 
genomic library and a fragment selected which hybrid- 
ized to the subject cDNA. The 5' and 3' non-coding re- 
gions were isolated and manipulated for insertion of a 
foreign sequence to be transcribed under the regulation 
of the 2A11 promoter. 

The cells which have been transformed may be 
grown into plants in accordance with conventional ways. 
See, for example, McCormick et al . , Plant Cell Reports 
(1986) ^:81-84. These plants may then be grown, and 
either pollinated with the same transformed strain or 
different strains, identifying the resulting hybrid 
having the desired phenotypic characteristic. Two or 
more generations may be grown to ensure that the sub- 
ject phenotypic characteristic is stably maintained and 
inherited and then seeds harvested for use to provide 
fruits with the new phenotypic property. 

A ptotein is provided having the sequence de- 
-scribed in the Experimental section designated as 2A11. 
This protein could be a storage protein and be useful 
in enhancing sulfur containing amino acids (cysteine 
and methionine) in the diet. It can be obtained in 
substantially pure form by providing for expression in 
prokaryotes or eukaryotes, e.g., yeast by inserting the 
open reading frame into an expression cassette contain- 
ing a transcriptional initiation region. A variety of 
expression cassettes are commercially available or have 
been described in the literature. See, for example, 
U.S. Patent Nos. 4,532,207; 4,546,082; 4,551,433; and 
4,559,302. The product, if intracellular, may be iso- 
lated by lysing of the cells and purification of the 
protein using electrophoresis, affinity chromatography, 
HPLC extraction, or the like. The product may be iso- 
lated in substantially pure form free of other plant 
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products, generally having at least about 95% purity, 
usually at least about 99% purity. 

The following examples are offered by way of 
illustration and not by limitation. 

5 

EXPERIMENTAL 



Example 1 

Construction of Tomato Ripe Fruit cDNA Bank 
10 and Screening for Fruit-Specific Clones 

Tomato plants ( Lycopersicon esculentum cv 
DC82B) were grown under greenhouse conditions. Poly (A)'''RNA 
was isolated as described by Mansson et al. , Mol. Gen. 
Genet. (1985) 200:356-361. The synthesis of cDNA from 
15 poly (A)* RNA prepared from ripe fruit, cloning into the 
PstI site of the plasmid pUC9 and transformation into 
an E. coli vector were all as described in Mansson et 
alw Mol. Gen. Genet. (1985) 200:356-361. 

20 Library Screening 

Two thousand recombinant clones were screened 
by colony hybridization with radiolabeled cDNA made 
from tomato red fruit mRNA, immature 'green fruit mRNA, 
and leaf mRNA. Bacterial colonies immobilized onto 

25 GeneScreen Plus filters (New England Nuclear), were de- 
natured in 1.5 M NaCl in 0.5 M NaOH, then neutralized 
in 1.5 M NaCl in 0.5 M Tris-HCl pH 8, and allowed to 
air dry. Hybridization, washing and autoradiography 
were all performed as described in Maniatis et al. , 

30 Molecular Cloning: A Laboratory Manual (1982) Cold 
Spring Harbor, New York. 

Sixty-five clones were selected which had more 
intense hybridization signals with fruit cDNA than with 
leaf cDNA and therefore appeared to be under-represent- 

35 ed in the leaf mRNA population relative to the fruit 

population. Replicate slot blot filters were prepared 
using purified DNA from the selected clones and hybrid- 



ized with radioactive cDNA from leaf, green fruit, and 
red fruit as before. This allowed selection of cDNA 
clone 2A11, also referred to as pCGN1299 which is on at 
high levels in both the fruit stages (red and green) 
5, and off in the leaf. 

Example 2 
Analysis of Clones 
Synthesis of RNA Probes 

10 The CDNA insert of pCGN1299 was excised as an 

EcoRI to Hindlll fragment of approximately 600 bp (as 
measured on an agarose gel), and subcloned into the 
Riboprobe vector pGEMl (Promega Biotec) , creating 
PCGN488. ^^P-labeled transcripts made from each strand 

15 of the pCGN488 insert using either SP6 or T7 polymerase 
were used as probes in separate Northern blots contain- 
ing mRNA from leaf, immature green and mature red 
fruits. The RNA transcript from 'the SP6 promoter did 
not hybridize to the tomato mRNA. However, the tran- 

20 script from the T7 promoter hybridized to an mRNA of 

approximately 700 nt in length from the green fruit and 
the red fruit but not to mRNA from tomato leaf. The 
direction of transcription of the corresponding mRNA 
was thus determined. 

25 The tissue specificity of the pCGN1299 cDNA 

was demonstrated as follows. RNA from root, stem, 
leaf, and seven stages of fruit development (immature 
green, mature green, breaker, turning, pink, light red, 
and red) was sized on formaldehyde/agarose gels accord- 

30 ing to the method described by Maniatis et al. , (1982), 
immobilized, on nitrocellulose and hybridized to 32p_ 
labeled RNA which was synthesized in vitro from pCGN488 
using T7 polymerase. Each lane contained 100 ng of 
polyA"^ RNA except for two lanes (pink and light red 

35 lanes) which contained 10 ug of total RNA. The 

Northern analysis of mRNA from root, stem, leaf, and 
various stages of fruit development indicated that 
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PCGN1299 cDNA was expressed in all stages of fruit 
development from the early stages immediately after 
anthesis to red ripe fruit. No mHNA hybridizing to 
PCGN1299 was found in leaf, stem, or root tissue. The 
5^ size of the mRNA species hybridizing to the pCGN488 
probe was approximately 700 nt. 

Message abundance corresponding to the 
PCGN1299 cDNA was determined by comparing the hybrid- 
ization intensity of a known amount of RNA synthesized 

10 in vitro from pCGN488 using SP6 polymerase to mRNA from 
red tomato fruit in a Northern blot. The "^^P-labeled 
transcript from pCGN488 synthesized in vitro using T7 
polymerase was used as a probe. The Northern analysis 
was compared to standards which indicated that the 

1,5 pCGN1299 cDNA represents an abundant mRNA class in 
tomato fruit, being approximately 1% of the message. 

Example 3 
Sequencing of PCGN1299 and 
20 PCGN1298 cDNA Clones 

DNA Sequencing 

The polyA+ sequence was missing from pCGN1299 
CDNA. A longer cDNA clone, pCGN1298, therefore was 
identified by its hybridization with the pCGN488 probe. 

25 The complete DNA sequence of the two cDNA inserts was 
determined using both Maxam-Gilbert and the Sanger di- 
deoxy techniques and is as follows. The sequence of 
pCGN1298 contains additional sequences at both the 5' 
and 3' end compared to pCGN1299. As shown in Figure 1, 

30 the sequences are identical over the region that the 
two clones have in common. 

Amino Acid Sequence 

The pCGN1299 cDNA sequence was translated in 
35 three frames. The longest open reading fraune (which 

starts from the first ATG) is indicated. Both pCGN1299 
and PCGN1298 have an open reading frame which encodes a 



96 amino acid polypeptide (see Figure 1). The protein 
has a hydrophobic N-terminus which may indicate a 
leader peptide for protein targeting. A hydrophobicity 
profile was calculated using the Hopp and Woods, (Proc. 
Natl, Acad. Sci. USA (1981) 78:3824-3828) algorithm. 
Residues 10-23 have an extremely hydrophobic region. A 
comparison of 2A11 to pea storage proteins and other 
abundant storage proteins is shown in Figure 2. The 
sulfur-rich composite of the fruit-specific protein is 
similar to a pea storage protein which has recently 
been described (see Higgins et al., J. Biol. Chem. 
(1986) 261:11124-11130, for references to the individ- 
ual peptides). This may indicate a storage role for 
this fruit-specific protein abundant species. 

Example 4 
Screening Genomic Library 
for Genomic Clones 
Southern Hybridization 

Southern analysis was performed as described 
by Maniatis et al., 1982. Total tomato DNA from cult- 
ivar UC82B was digested with EcoRI or Hindlll, sepa- 
rated by agarose gel electrophoresis and transferred to 
nitrocellulose. Southern hybridization was performed 
using a ^^P-labeled probe produced by niclc translation 
of PCGN488 (Maniatis et al. , 1982). The simple 
hybridization pattern indicated that the gene encoding 
pCGN1299 cDNA was present in a few or perhaps even one 
copy in a tomato genome. 

Isolation of a Genomic Clone 

A genomic library established in Charon35/ 
Sau3A constructed from DNA of the tomato cultivar VFNT- 
Cherry was screened using the [-^^PJ-RNA from cDNA clone 
pCGN488 as a probe. A genomic clone containing approx- 
imately 12.5 kb of sequence from the tomato genome was 
isolated. The region which hybridizes to a pCGN48a 
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probe spans an 2CbaI restriction site which was found in 
the cDNA sequence and includes the transcriptional 
initiation region designated 2A11. 

Sequence of Genomic Clone 

The DNA sequence of the genomic clone was 
determined by Sanger dideoxy techniques and is as shown 
in Figure 4. The sequence of the genomic clone is 
identical to the pCGN1299 cDNA clone over the region 
they have in common. 

Subcloninq 

The region surrounding the Xba l restriction 
site, approximately 2.4 Icb in the 5* direction and ap- 
proximately 2.1 kb in the 3' direction was subcloned to 
provide an expression cassette. The 5' Xhol to Xba l 
fragment and the 3' Xba l to EcoRI fragment from the 
2A11 genomic clone were inserted into a pOC-derived 
chloromphenicol plasmid containing a unique Xho l site 
and no Xba l site. This promoter cassette plasmid is 
called PCGN1273. 

Example 5 
Construction of Fruit- 
Specific Antisense Cassette 
Insertion of Antisense Fragment 

The 2A11 genomic fragment was tagged with PC 
antisense sequences by insertion of PG into the unique 
Xbal site of the pCGN1273 promoter cassette in the anti- 
sense orientation. The inserted sequences increased 
the size of the mRNA over the endogenous transcript, 
and thus the expression pattern of the construct could 
be compared to the endogenous gene by a single Northern 
hybridization in a manner analogous to the detection of 
a tuber-specific potato gene described by Eckes et al. , 
Mol. Gen. Genet. 1986 205:14-22. 
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Example 6 

Insertion of Tagged Genomic Construction 
Into Agrobacterium Binary Vectors 
The tagged genomic construction is excised 
5^ using the flanking Xho l restriction enzyme sites and is 
cloned into, .ifhe unique Sai l site of the binary plasmid 
pCGN7a3 containing a plant kanamycin resistance marker 
between the left and right borders to provide plasmid 
PCGN1269. 

10 This plasmid binary vector in E. coli C2110 is 

conjugated into A. tumefaciens containing a disarmed 
Ti-plasmid capable of transferring the polygalacturo- 
nase antisense cassette and the kanamycin resistance 
cassette into the plant host genome, 

15 The Agrobacterium system which is employed is /" 

A. tumefaciens PC2760 (G. Ooms et al., Plasmid (1982) 
7:15-29; Hoekema et al . , Nature (1983) 303:179-181; 
European Patent Application 84-200239-6, 2424183)* 

20 1. Construction of pCGN783 

pCGN7a3 is a binary plasmid containing the 
" left and right T-DNA borders of A. tumefaciens octopine 
Ti-plasmid pTiA6 (Currier and Nester, J, Bacterid. 
(1976) 126 :157-165) the gentamicin resistance gene of 

25 pPHlJl (Eirsch et al*., Plasmid (1984) 12:139-141), the 

35S promoter of cauliflower mosaic virus (CaMV) ( 
(Gardner et al. , Nucleic Acid Res. (1981) 9:1871-1880); 
the kanamycin resistance gene of TnS (Jorgensen, Mol. 
Gen. (1979) 177:65); and the 3* region from transcript 

30 7 of pTiA6 (Currier and Nester, supra (1976)). A sche- 
matic diagram of the construction of pCGN783 is shown 
in Figure 3. (a) through (f) refer to the plasmid 
constructions detailed below. 



35 
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(a) Construction of PCGN587 

The Hindlll-Smal fragment of Tn5 containing 
the entire structural gene for APH3'II (Jorgensen et 
al,, Mol. Gen, (1979) r77:65)/ was cloned into pUC8 
5 (Vieira and Messing, Gene (1982) 19:259)/ converting 

the fragment into a Hindlll-EcoRI fragment , since there 
is an EcoRI site immediately adjacent to the Sma l site. 
The Pstl-EcoRI fragment containing the 3* portion of 
the APH3'II gene was then combined with an EcoRI-BamHI- 

10 Sall-PstI linker into the Eco RI site of pDC7 (pCGN546W) . 
Since this construct does not confer kanamycin resis- 
tance, kanamycin resistance was obtained by inserting 
the Bql l- Pst I fragment of the APH3'II gene into the 
BamHI-PstI site (pCGN546X) . This procedure reassembles 

15 the APE3*II gene, so that Eco RI sites flank the gene. 

An ATG codon was upstream from and out of reading frame 
with the ATG initiation codon of APHS'II. The undesi- 
red ATG was avoided by -inserting a Sau3A-PstI fragment 
from the 5' end of APH3'II, which fragment lacks the 

20 superfluous ATG, into the BamH I- Pst I site of pCGN546W 
to provide plasmid pCGNSSO, The Eco RI fragment of 
pCGNSSO containing...the APH3'II gene was then cloned 
into the EcoR I site of pDC8-pOC13 (K. Buckley supra 
(1985) ) to give pCGNSSl. 

25 ■ Each of the Eco RI fragments containing the 

APH3'II gene was then cloned into the unique . EcoRI site 
of pCGN451, which contains an octopine synthase cas- 
sette for expression to provide pCGN548 (2ATG)) and 
PCGN552 (lATG) . . The. plasmid pCGN451 having the ocs 5' 

30 and the ocs 3* in the proper orientation was digested 
with Eco RI and the EcoRI fragment from pCGN551 con- 
taining the intact kanamycin resistance gene inserted 
with EcoR I site to provide pCGN552 having the kanamycin 
resistance gene in the proper orientation. This ocs/KAN 

35 gene was used to provide a selectable marker for the 
trans type binary vector pCGN587. 



The 5' portion of the engineered octopine syn- 
thase promoter cassette consists of pTiA6 DNA from the 
Xhol at bp 15208-13644 (Barker et al. , supra (1983)), 
which also contains the T-DNA boundary sequence (border) 
implicated in T-DNA transfer. In the plasmid pCGN587r 
the osc/KAN gene from pCGN552 provides a selectable 
marker as well as the right border. The left boundary 
region was first cloned in M13mp9 as a Hindlll-Smal 
piece (PCGN502) (base pairs 602-2212) and recloned as a 
Kpn I-ScoRI fragment in pCGN565 to provide pCGNSSO. 
PCGN565 is a cloning vector based on pDC8-Cm, but con- 
taining pDC18 linkers. pCGNSSO was linearized with 
BamHI and used to replace the smaller Bql l fragment of 
pVCK102 (Knauf and Nester, Plasmid (1982) 8:45), creat- 
ing pCGNSas. By replacing the smaller Sai l fragment of 
PCGN585 with the Xho l fragment from pCGN552 containing 
the ocs/KAN gene, pCGN587 was obtained. 

(b) Construction of PCGN739 (Binary Vector) 

To obtain the gentamicin resistance marker/ 
the resistance gene was isolated from a 3.1 kb ScoRI- 
Pstl fragment of pPHIJI (Hirsch et al., Plasmid (1984) 
12:139-141) and cloned into pDC9 (Vieira et al.. Gene 
(1982) 19:259-268) yielding pCGN549. 

The pCGN549 Hindlll-BamHI fragment containing 
the gentamicin resistance gene replaced the Hindlll- 
Bglll fragment of pCGN587 (for construction, see 
6.1(a), supra ) creating pCGN594. 

The PCGN594 Hin di I I - BamH I region which con- 
tains an ocs-kanamycin-ocs fragment was replaced with 
the Hin dlll- BamH I polylinker region from pDC18 (Yanisch- 
Perron, Gene (1985) 33:103-119) to make pCGN739. 

(c) Construction of 726c (1 ATG-Kanamycin-3 ' region) 
pCGN566 contains the EcoRI-Hindlll linker of 

pUClS (Yanisch-Perron, ibid) inserted into the EcoRI- 
Hindlll sites of pUC13-Cm (K. Buckley, Ph.D. Thesis, 
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University of California, San Diego/ 1985). The Hindlll- 
B2III fragment of pNW31c-8, 29-1 (Thomashow et al.. Cell 
(1980) 19:729) containing ORFl and 2 (Barker et al. , 
Plant Mol . Biol. (1984) 2:335-350) was subcloned into 
the Eindlll-BamHI sites of pCGN566 producing pCGN703. 

The Sau3A fragment of pCGN703 containing the 
3' region of transcript 7 from pTiA6 (corresponding to ' 
bases 2396-2920 of pTil5955 (Barker et al . , supra (1984)) 
was subcloned into the BamHI site of pDClS (Yanisch- 
Perron et al., supra (1985)) producing pCGN709. 

The EcoRI-Smal polylinker region of pCGN709 
was replaced with the EcoRI-Smal fragment from pCGN587 
(see 6.1(a), supra ) which contains the kanamycin resis- 
tance gene (APH3'II) producing pCGN726. 

The EcoRI-Sall fragment of pCGN726 plus the 
Bglll-Sall sites of pDC8-pUC13-cm ( chloramphenical 
resistant, K. Buckley, Ph.D. Thesis, University of 
California, San Diego, 1985) producing pCGN738. To 
construct pCGN734, the Eindlll-Sphl site of M13mpl9 
(Norrander et al. , Gene (1983) 26:101-106). Using an 
oligonucleotide corresponding to bases 3287 to 3300, 
DNA synthesis was primed from this template. Following 
SI nuclease treatment and Hindlll digestion, the resul- 
ting fragment was cloned into the Hindlll-Smal site of 
pUC19 (Yanisch-Perron et al., suora (1985)), The re- 
sulting EcoRI to Hindlll fragment of pTiA6 (correspon- 
ding to bases 3390-4494) into the EcoRI site of paC8 
(Vieira and Messing, suora (1982)) resulting in pCGN734. 
pCGN726c is derived from pCGN738 by deleting the 900 bp 
EcoRI -EcoRI fragment. 

(d) Construction of PCGN167 

PCGN167 is a construct containing a full length 
CaMV promoter, 1 ATG-kanamycin gene, 3' end and the 
bacterial. Tn903-type kanamycin gene. MI is an EcoRI 
fragment from pCGN550 (see construction of pCGN587) and 
was cloned into the Eco RI cloning site in the 1 ATG- 
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Jcanamycin gene proximal to the polylinker region of 
M13nip9. See copending Application Serial No. 920,579, 
filed October 17, 1986, which disclosure is incorpora- 
ted herein by reference. 

To construct pCGN167, the Alul fragment of 
CaMV (bp 7144-7735) (Gardner et al. , Nucl. Acids Res. 

(1981) 9:2871-2888) was obtained by digestion with Alul 
and cloned into the Hindi site of M13mp7 (Vieira, Gene 

(1982) 19:259) to create C614. An EcoRI digest of C614 
produced the EcoRI fragment from C614 containing the 
35S promoter which was cloned into the EcoRI site of 
pUC8 (Vieira et al,. Gene (1982) 19:259) to produce 
pCGN146. To trim the promoter region, the Bglll site 
(bp 7670) was treated with Bqlll and Bal31 and subse- 
quently a B2III linker was attached to the Bal31 trea- 
ted DNA to produce pCGN147. 

pCGN148a containing the promoter region, 
selectable marker (KAN with 2 ATGs) and 3' region was 
prepared by digesting pCGN528 (see below) with Bglll 
and inserting the BamEI-Bqlll promoter fragment from 
PCGN147. This fragment was cloned into the Bqlll site 
•of PCGN528 so that the Bqlll site was proximal to the 
kanamycin gene of pCGN528. 

The shuttle vector used for this construct, 
PCGN528, was made as follows. pCGN525 was made by 
digesting a plasmid containing Tn5. which harbors a 
kanamycin gene (Jorgenson et al., Mol. Gen. (1979) 
177:65) with Hindlll-BamHI and inserting the Hindlll- 
BamHI fragment containing the kanamycin gene into the 
Hindlll-BamEI sites in the tetracycline gene of 
PACYC184 (Chang and Cohen, J. Bacterid. (1978) 134: 
1141-1156). pCGN526 was made by inserting the BamHI 
fragment 19 of pTiA6 (Thomashow et al . , Cell (1980) 
19:729-739) into the BamHI site of pCGN525. pCGN528 
was obtained by deleting the small Xhol fragment from 
PCGN526 by digesting with Xho l and religating. 
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pCGN149a was made by cloning the Bam HI kana- 
mycin gene fragment from pMB9KanXXI into the BamHI site 
of pCGN148a. pMB9KanXXI is a pDC4K variant (Vieira and 
Messing, Gene (1982) 19:259-268) which has the Xhol site 
missing but contains a functional kanamycin gene from 
Tn903 to allow for efficient selection in Aqrobacterium . 

pCGN149a was digested with Bglll and Sphl. 
This small Balll-Sohl fragment of pCGN149a was replaced 
with the BamHI -Sghl fragment from MI (see below) iso- 
lated by digestion with BamHI and Sohl. This produces 
PCGN167. 

(e) - Construction of pCGN766c (35S promoter-3' region) 

The Eindlll-BamHI fragment of pCGN167 contain- 
ing the CaMV-35S promoter, 1 ATG-kanamycin gene and the 
BamHI fragment 19 of pTiA6 was cloned into the BamHI- 
Hindlll sites of pOC19 (Norrander et al. , supra (1985); 
Yanisch-Perron et al., suora (1985)) creating pCGN976. 

The 35S promoter and 3' region from transcript 
7 was developed by inserting a 0.7 kb Hindlll-EcoRI 
fragme.nt of pCGN976 (35S promoter) and the 0.5 kb 
EcoRI-Sall fragment of pCGN709 (transcript 7:3' for 
construction see suora ) into the Hindlll-Sall sites of 
pCGN566 creating pCGN766c. 

(f) Final Construction of PCGN783 

The 0.7 kb Hindlll-EcoRI fragment of pC6N766c 
(CaMV-35S promoter) was ligated to the 1.5 kb EcoRI- 
Sall fragment of pCGN726c (l-ATG-KAN-3 ' region) into 
the Hindlll-Sall sites of pDC119 (J. Vieira, Rutgers 
University, New Jersey) to produce pCGN778. The 2.2 kb 
region of pCGN778, Hindlll-Sall fragment containing- the 
CaMV 35S promoter (l-ATG-KAN-3' region) replaced the 
Hindlll-Sall polylinker region of pCGN739 to produce 
PCGN783. 



Example 7 
Transfer of Genomic Construction 
to Tomato via Cocultivation 

Substantially sterile tomato cotyledon tissue 
is obtained from seedlings which have been grown at 24«C/ 
with a 16hr./ffhr day/night cycle in 100x25 iran petri dishes 
containing Murashige-Skoog salt medium and 0.8% agar 
(pH 6.0). Any tomato species may be used, however, 
here the inbred breeding line was DC82B, available from 
the Department of Vegetable Crops, University of Cali- 
fornia, Davis, CA 9 5616. The cotyledons are cut into 
three sections and the middle placed onto feeder plates 
for a 24-hour preincubation. The feeder plates are pre- 
pared by pipetting 0.5 ml of a tobacco suspension cul- 
.=ture (io^ cells/ml) onto 0.8% agar medium, containing 
Murashige minimal organic medium (K.C. Biologicals) , 
2,4-D (0.1 mg/1), kinetin (1 mg/1), thiamine (0.9 mg/1) 
and potassium acid phosphate (200 mg/1, pH 5.5). The 
feeder plates are prepared two days prior to use. A 
sterile 3 mm filter paper disk containing feeder medium 
is placed on. top of the tobacco cells after the suspen- 
sion cells are grown for two days."" 

Following the preincubation period, the middle 
one third of the cotyledon sections are placed into a 
liquid MG/L broth culture (1-5 ml) of the A. tumefaci- 
ens strain. The binary plasmid pCGN1269 is transferred 
to A. tumefaciens strain 2760 by conjugation or by 
transformation selecting for Gentamicin resistance en- 
coded by the plasmid pCGN1269. The cotyledon sections 
are cocultivated with the bacteria for 48 hrs on the 
feeder plates and then transferred to regeneration 
medium containing 500 mg/1 carbenicillin and 100 mg/1 
kanamycin. The regeneration medium is a K.C. Biologi- 
cals Murashige-Skoog salts medium with zeatin (2 mg/1) 
myo-inositol (100 mg/1), sucrose (20 g/1), Nitsch vita- 
mins and containing 0.8% agar (pH 6.0). In 2-3 weeks, 
shoots are observed to develop. When the shoots are 
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approximately 1.25 cm, they are excised and transferred 
to a Murashige and Skoog medium containing carbenicil- 
lin (500 mg/1) and kanamycin (50 mg/1) for rooting. 
Roots develop within 10-12 days. 

Shoots which develop and subsequently root on 
media contai'ning the kanamycin are tested for APH3'II 
enzyme . 

An aminoglycoside phosphotransferase enzyme 
(APH3'II) assay is conducted on putative transformed 
tomato plants and shoots. APH3'II confers resistance 
to kanamycin and neomycin. APH3'II activity is assayed 
(Reiss et al.. Gene (1984) 30:211-218) employing elec- 
trophoretic separation of the enzyme from other inter- 
fering proteins and detection of its enzymatic activity 
by in situ phosphorylation of kanamycin. Both kanamy- 
cin and [t-^^P] ATP act as substrates and are embedded 
in an agarose gel which is placed on top of the poly- 
acrylamide gel containing the proteins. After the 
enzymatic reaction, the phosphorylated kanamycin is 
transferred to P-81 phosphocellulose ion exchange paper 
and the radiolabeled kanamycin is finally visualized by 
autoradiography. The Reiss et al. method is modified 
in the final washing of the P-81 ion exchange paper by 
rinsing in 0.1 mg/ml of proteinase K. 

Example 8 

Construction of Tagged 2A11 Plasmids 
In Binary Vectors 

The complete sequence of the 2A11 genomic DNA 
cloned into pCGN1273 from the Xho l site (position 1 at 
the 5' end) to the EcoRI site (position 4654) is shown 
in Figure 4. 

pCGN1267 was constructed by deleting from 
pCGN1273 a portion of the plasmid polylinker from the 
EcoRV site to the BamHI site. Two DNA sequences were 
inserted into pCGN1273 at the unique Xbal site (posi- 
tion 2494). This site is in the 3* non-coding region 
of the 2A11 genomic clone before the poly A site. 
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pCGN1273 was tagged with 360 bp (from base 
number 1 to 360) from the 5' region of the tomato poly- 
galacturonase (PG) cDNA clone, Fl (Sheehy et al . , Mol, 
Gen, Genet. (1987) 208 ;30"36) at the unique Xba l re- 
striction enzyme site. The tag was inserted in the 
antisense orientation resulting in plasmid pCGN1271 and 
in the sense orientation yielding plasmid pCGN1270. 
Each plasmid was linearized at the unique Bql ll re- 
striction enzyme site and cloned into the binary vector 
pCGN783 at the unique BamEI restriction enzyme site. 

pCGN1273 was also tagged with a 0.5 kb frag- 
ment of DNA (base number 1626 to 2115) from a PG geno- 
mic clone (see Figure 5) which spans the 5' end of the 
intron/exon junction. This fragment was cloned into 
the Xba l site resulting in plasmid pCGN1215. pCGN1215 
was linearized at the unique Belli site and cloned into 
pCGN783 at the BamH I site resulting in two plasmidsr 
PCGN1219 and pCGN1220, which differ only in the orien- ' 
tation of pCGN1215 within pCGN7a3. 

Three DNA sequences were inserted into pCGN1267 
at the uniqufe Cla l sites (position 2402, 2406)* These 
sites are in the 3' non-coding region of the 2A11 geno- 
mic clone, 21 bp from the stop codon. The 383 bp Xba l 
fragment from the PG cDNA clone was cloned into the 
Cla l site of pCGN1267- after filling in the Xba l and 
Cla l ends with Klenow and blunt ligation. The fragment 
in a sense orientation resulted in plasmid pCGN1263 and 
in the antisense orientation gave pCGN1262. pCGN1263 
was linearized at the unique Bgl ll site and cloned into 
PCGN783 at the BamEI site yielding pCGN1260. pCGN1262 
was also linearized at the Bgl ll site and cloned into 
pCGN783 at the BamH I site resulting in two plasmids, 
pCGN1255 and pCGN1258, which differ only in the orien- 
tation of pCGN1262 in the binary vector pCGN783. 

The 0.5 kb fragment of the PG genomic clone 
spanning the intron/exon junction ( supra ) was cloned 
into pCGN1267 at the Clal site in an antisense direc- 



tion yielding plasmid. pCGN1225 . This plasmid was • 
linearized at the Bglli restriction enzyme site and 
cloned into pCGN783 at the BamEI site producing two 
plasmids, pCGN1227 and pCGN122a, which differ only in 
the orientation of pCGN1225 in the binary vector. 

The Eco 7 fragment (base numbers 5545 to 
12,823) (Barker et al. , Plant Mol. Biol. (1983) 2:335- 
350) from the octopine plasmid pTiA6 of A. tumefaciens 
(Knauf and Nester, Plasmid (1982) 8:45-54) was sub- ' 
cloned into pDC19 at the EcoRI site resulting in plas- 
mid pCGNTl. A Rsa l digest allowed a fragment of DNA 
from bases 8487 to 9036 of the Eco 7 fragment to be sub- 
cloned into the vector ml3 BlueScript Minus (Strata- 
gene, Inc.) at the Sma l site resulting in plasmid 
PCGN1278. This fragment contains the coding region 
of the genetic locus designated tmr which encodes a 
dimethylallyl transferase (isopentenyl transferase) 
(Akiyoshi et al., Proc. Natl. Acad. Sci. USA (1984) 
81:5994-5998; Barry et al. , ibid (1984) 81:4776-4780). 
An exonuclease/mung bean treatment (Promega Biotech) 
produced a deletion on the 5 ' end of the tmr gene to a 
point 39 base pairs 5' of the start codon. The tmr 
gene from pCGK1272 was subcloned into the Clal site of 
pCGN1267. The tmr gene in the sense orientation 
yielded pCGN1261 and in the antisense orientation gave 
plasmid pCGlll266. pCGN1261 was linearized at the Bqlll 
site and cloned into pCGN783 at the BamHI site result- 
ing in plasmid pCGN1254. pCGN1266 was also linearized 
at the Bqlll site and subcloned into pCGN783 at the 
BamHI site yielding two plasmids, pCGN1264 and pCGN1265, 
which differ only in the orientation of pCGN1266 in 
PCGN783, 

Analysis of Expression in Transgenic Plants 

Immature green fruit (approximately 3.2 cm in 
length) was harvested from twO' tomato plants cv. DC82B 
that had been transformed with a disarmed Agrobacterium 
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-Strain containing pCGN1264. Transgenic plants are 
designated 1264-1 and 1264-11.- The pericarp from two 
fruits of each plant was ground to a powder under liq- 
uid Nj/ total RNA extracted and polyA"*" mRNA isolated 
(as described in Mansson et al., Mol. Gen. Genet. 
(1985) 200:356-361). Young green leaves were also 
harvested from each plant and polyA* mRNA isolated. 

Approximately 19 yg of total RNA from fruit, 
70 ng of polyA* mRNA from fruit and 70 ng of polyA"^ 
xnRNA from leaves from transformed plants 1264-1 and 
1264-11 was run on a 0.7% agarose formaldehyde Northern 
gel and blotted onto nitrocellulose (Maniatis et al.. 
Molecular Cloning: A Laboratory Manual (1982) Cold 
Spring Harbor, New York). Also included on the gel as 
a negative control was approximately 50 ng of polyA* 
mRNA from leaf and immature green fruit of a nontrans- 
formed UC82B plant. 

As a positive control and to help in guanti- 
tating mRNA levels, in vitro transcribed RNA from 
pCGN1272 was synthesized using T3 polymerase (Strata- 
gene, Inc.).,- Nineteen pg and 1.9 pg of this in vitro 
synthesized RNA were loaded on the Northern gel. 

The probe for the Northern filter was the 
1.0 kb tmr insert DNA (a Kpn l to Sad fragment) from 
PCGN1272 isolated by elect roelut ion from an agarose gel 
(Maniatis, supra (1982)) and labeled by nick transla- 
tion (Bethesda Research Laboratory kit) using a^^p ^CTP 
(Amersham) . 

The Northern filter was prehybridized at 42«»C 
for 5 hrs in the following solution: 25 ml formamide, 
12.5 ml 20X SSC, 2.5 ml 1 M NaP, 5 ml 50X Denhardts, 
0.5 ml 10% SDS, 1 ml 250 mM EDTA, 1 ml 10 mg/ml ssDNA 
and 2 ml HjO. Then one-fifth volume of 50% dextran 
sulfate and approximately 2.2X lo'' cpm of the probe was 
added and hybridization was for 15 hrs at 42''C. 



The Northern filter was washed one time in 2X 
SSC and 0.1% 'SDS at 55«C for 20 minutes each wash. The 
filter was allowed to air dry before being placed with 
Kodak XAR film and an intensifying screen at -70* for 
5 two days . 

Northern Results on Transgenic Plants 

The nicked tmr probe hybridized with a mRNA 
species approximately 1.7 kb in length was observed in 

10 the total RNA and polyA"^ mRNA fruit lanes of the 

Northern blot. This is the expected length of the 
reintroduced 2A11 gene (0.7 kb) tagged with the tmr 
gene (1.0 kb) in the antisense orientation. The level 
of expression from the reintroduced tagged gene is 

15 somewhat lower than the level of expression of the 

endogenous 2A11 gene. The level of expression of the 
reintroduced gene in immature green fruit is higher 
than the expression level in leaf tissue with. a small 
amount of hybridizing mRNA in leaf tissue in these 

20 transf ormants • 

Example 9 
Screening Genomic Library fbr 
Polygalacturonase Genomic Clones 
25 Isolation of a Genomic Clone 

An Sco RI partial genomic library established 
in Charon 4 constructed from DNA of a Lycopersicon 
esculentum cultivar was screened using a probe from the 
polygalacturonase cDNA (Sheehy et al . ^ Mol. Gen. Genet. 
.30 (1987) 208 ;30-36) . A lambda clone containing an ap- 
proximately IS kb insert was isolated from the library, 
of which an internal 2207 bp Hindlll to EcoRI was se- 
quenced. The Hindlll-EcoRI fragment includes the poly- 
galacturonse promoter region. 
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Sequence of Genomic Clone 

The DNA sequence of the genomic clone was 
determined by Sanger dideoxy techniques and is as shown 
in Figure 5. The sequence of the genomic clone bases 
1427 to 1748 are homologous to the polygalacturonase 
cDNA sequence. 

The above results demonstrate the ability to 
identify inducible regulatory sequences in a plant 
genome, isolate the sequences and manipulate them. In 
this way, the production of transcription cassettes and 
expression cassettes can be produced which allow for 
differentiated cell production of the desired product. 
Thus, the phenotype of a particular plant part may be 
modified, without requiring that the regulated product 
be produced in all tissues, which may result in various 
adverse effects on the growth, health, and production 
capabilities of the plant; Particularly, fruit-specific 
transcription initiation capability is provided for mo- 
difying the phenotypic properties of a variety of fruits 
to enhance properties of interest such as processing, 
organoleptin properties) storage, yield, or the like. 

E. coli strain pCGN1299x7118 was deposited 
with the American Type Culture Collection (A.T.C.C.), 
12301 Parklawn Drive, Rockville, Maryland, 20852 on 
May 21, 1987 and given Accession No. 67408. 

All publications and patent applications men- 
tioned in this specification are indicative of the 
level of skill of those skilled in the art to which 
this invention pertains. All publications and patent 
applications are herein incorporated by reference to 
the same extent as if each individual publication or 
patent application was specifically and individually 
indicated to be incorporated by reference. 



Although the foregoing invention has been de- 
scribed in some detail by way of illustration and ex- 
ample for purposes of clarity of understanding, it will 
be obvious that certain changes and modifications may 
be practiced within the scope of the appended claims. 
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WHAT IS CLAIMED IS: 
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1. A DNA construct comprising in the direction of 
transcription, a fruit-specific transcriptional initia- 
tion region from a gene expressed at or immediately 
after anthesis or at the breaker stage, said gene 
remaining expressed at least until the ripe period, 
joined to a DNA sequence of interest other than the 
wild-type sequence associated with said initiation re- 
gion, wherein said DNA sequence of interest is under 
the transcriptional regulation of said initiation 
region, and a transcriptional termination region. 

2. A DNA construct according to Claim 1, wherein 
15 said transcriptional initiation region is from a gene 

expressed iimaediately upon anthesis. 

3. A DNA construct according to Claim 1, wherein 
said transcriptional initiation region regulates tran- 
scription of a gene encoding a plant storage protein. 
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4. A DNA construct according to Claim 3, wherein 
said transcriptional initiation region is the 2A11 
region. 

5. A DNA construct according. to Claim 1, wherein 
said DNA sequence of interest is a sequence complemen- 
tary to a native plant transcript. 

6. A DNA construct according to Claim 1, wherein 
said DNA sequence of interest is an open reading frame 
encoding an amino acid sequence of interest. 

7. A DNA construct according to Claim 1, wherein 
35 said DNA sequence of interest is a polygalacturonase 

gene or fragment thereof of at least 12nt in the anti- 
sense direction. 
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8. A DNA construct for integration into a plant 
genome comprising at least the right T-DNA border 
joined to a DNA construct according to Claim 1. 

9. A DNA construct comprising in the direction of 
transcription, the fruit-specif ic . transcriptional ini- 
tiation region of a plant storage protein being active 
at or immediately after anthesis and remaining active 
until at least until the ripe period, joined to a DNA 
sequence other than the wild-type sequence, wherein 
said sequence comprises a unique restriction site for 
insertion of a sequence of interest to be under the 
transcriptional regulation of said initiation region, 
and a transcriptional termination region. 

10. A DNA construct according to Claim 9, wherein 
said transcriptional initiation region is the 2A11 
region. 

11. A DNA construct for integration into a plant 
genome comprising at least the right T-DNA border 
joined to a DNA construct according to Claim 10. 

12. A DNA vector comprising a broad spectrum 
proJcaryotic replication system and a DNA construct 
according to Claim 1. 

13. A DNA vector comprising a broad spectrum 
prokaryotic replication system and a DNA construct 
according to Claim 9. 

14. A method for specifically modifying the pheno- 
type of fruit substantially distinct from other plant 
tissue, said method comprising: 
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transforming a tomato plant cell with a DNA 
construct under genomic integration conditions, wherein 
said DNA construct comprises in the direction of tran- 
scription, a 2A11 fruit-specific transcriptional initi- 
ation region, joined to a DNA polygalacturonase gene 
sequence, wherein said sequence is oriented in the 
antisense direction and under the transcriptional regu- 
lation of said initiation region, and a transcriptional 
termination region, whereby said DNA construct becomes 
integrated into the genome of said plant cell, wiiereby 
said antisense sequence is transcribed and inhibits 
expression of polygalacturonase in fruit; 

regenerating a plant from said transformed 
plant cell; and 

growing said plant to produce fruit of the 
modified phenotype. 

15. A method according to Claim 14, wherein said 
transcription initiation region is the 2A11 region. 

16. A plant cell comprising a DUA construct 
according to Claim 1-; 

^7* A plant cell comprising a DNA construct 
according to Claim 9 

18. A method for specifically modifying the pheno- 
type of tomato fruit substantially distinct from other 
plant tissue, said method comprising: 

transforming a plant cell with a DNA construct 
under genomic integration conditions, wherein said DNA 
construct comprises in the direction of transcription, 
a fruit-specific transcriptional initiation region be- 
ing active at or immediately after anthesis, said gene 
remaining active at least until the ripe period, joined 
to a DNA sequence other than the wild-type sequence and 
capable of modifying the phenotype of fruit cells upon 
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transcription, wherein said sequence is under the tran- 
scriptional regulation of said initiation region, and a 
transcriptional termination region, whereby said DNA 
construct becomes integrated into the genome of said 
plant cell; 

regenerating a plant from said transformed 
plant cell; and 

growing said plant to produce fruit of the 
modified phenotype. 

19. A plant comprising a DNA construct according 
to Claim 1. 



20. A plant comprising a DNA construct according 
to Claim 9. 



21. Pruit comprising a construct according to 
Claim 1, 



22. Fruit according to Claim 21, wherein said 
fruit is tomato. 

23. Pruit according to Claim 22, wherein said DNA 
sequence of interest is a polygalacturonase gene or 
fragment of at least 12nt thereof oriented in the anti- 
sense direction and said transcsription initiation 
region is 2A11. 



24. Pruit according to Claim 21, wherein said 
transcription initiation region is 2A11. 
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3H11 TTTTTTTGAGCAAAGGGCAACTCAGATATCCAAAGATGAATCCAACATATA 51 



3H11 GCTTACAGCTGGGAGAACATTGTCTAACTCTTCTGAAATTTAAATGTTATC 102 



3H11 CAGAATCCTTCATCATAAAATAATATCAAAATGCAAATCTATTTTTTCTAC 153 



3H11 TCTTGTCTAGCTTCAACTTTCTTCTTCTGCTCATCAATTAGCAATTAATCC -204 

TGCTCATCAATTAGCAATTAAT£C 



3H11 AAAACCATTATGGCTGCCAAAAATTCAGAGATGAAGTTTGCTATCTTCTTC 255 
2A11 AAZVACCATTATGGCTGCCA AAAATTCAGAGATGAAGTTTGCTATCTTCTTC 

METAlaAlaLysAsnSerGluMETLysPheAlallePhePhe 



3H11 GTTGTTCTTTTGACGACCACTTTAGTTGATATGTCTGGAATTTCGAAAATG 306 
2A11 GTTGTTCTTTTGACGACCACTTTAGTTGATATGTCTGGAATTTCGAAAATG 
ValValLeuLeuThrThrThrLeuValAspMETSerGlylleSerLysMET 



3H11 CAAGTGATGGCTCTTCGAGACATACCCCCACAAGAAACATTGCTGAAAATG 357 
2A1 1 CAAGTGATGGCTCTTCGAGACATACCCCCACAAGAAACATTGCTGAAAATG 
GlnValMETAlaLeuArgAspIleProProGlnGluThrLeuLeuLysMET 



3H11 AAGCTACTTCCCACAAATATTTTGGGACTTTGTAACGAACCTTGCAGCTCA 408 
2A11 AAGCTACTTCCCACAAATATTTTGGGACTTTGTAACGAACCTTGCAGCTCA • 
LysLeuLeuProThrAsnlleLeuGlyLeuCysAsnGliiProCysSerSer 



3H11 AACTCTGATTGCATCGGAATTACCCTTTGCCAATTTTGTAAGGAGAAGACG 459 
2A11 AACTCTGATTGCATCGGAATTACCCTTTGCCAATTTTGTAAGGAGAAGACG 
AsnSerAspCysIleGlylleThrLeuCysGlnPheCysLysGliiLysThr 



3H11 GAGCAGTATGGTTTAACATACCGTACATGCAACCTGTTGCCTTGAACAATA 510 
2A11 GACCAGTATGGTTTAACATACCGTACATGCAACCTGTTGCCTTGAACAATA 
AspGlnTyr'^lyLeuThrTyrArgThrCysAsnLeuLeuPro . 



FIG. 1-1 
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3H11 TCAATGATCTATCGATCGATCTATCTATCTATTTATCTGTCTCTGCGCGTA 561 
2A1 1 TCAATGATCTATCGATCGATCTATCTATCTATTTATCTGTCTCTGCGCGTA 



3H11 TAGTGTTGTCTGTACCTTTGGTGTGAAGAATATGAATAAAGGGATACATAT 612 
2A1 1 TAGTGTTGTCTGTAGCTTTGGTGTGAAGAATATGAATAAAGGGATACATAT 



3H11 ATCTA6ATATATTCTAGGTAATGTCCTATTGTATTTAAAATTTGTAGCAAT 663 
2 Al 1 AICIAGATATATTCTAGGTAATGTCCTATTGTATTTAAAATTTGTAGCAAT 

3H11 GATTGTTTGAATAAAAACATACCATGAGTGAAATAATTATTCCACATTAAT 714 
2A11 GATTGTTTGAATAAAAACATACCATGAGTGAAATAATTATTCC 

3H11 TCACGTATTTATTTCACTTATGATACGTATTTTTGTTCCTTTCGCGTAAAA 765 
3H11 AAAAAAAAA 774 

FIG. 1-2 
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FIG. 2-1 
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2 All GENOMIC 








10 


20 


30 


40 


50 


CTCGAGCCCT 


TTAAAAAGTA 


TAGTCAATAT 


TTACGGTGAC 


CGTGAATTTC 


60 


70 


80 


90 


100 


TTAATTATGA 


TATATAATTT 


AAAAGAAATC 


ATGATCACAT 


TCTACTGATG 


110 


120 


130 


140 


150 


AGAACATGTG 


CTAATCAAGG 


GAAAACATGG 


ATGTGAAAAA 


TACTTTTTGT 


160 


170 


180 


190 


200 


TAAAAGTAAA 


AAAAAATGTG 


AAATTTTGTT 


AGTTATTTAC 


TACCTATACA 


210 


220 


230 


240 


250 


TTATTTGAGC 


ATGTGCAAAC 


TTTACAAATA 


CCTAATAGAA 


GATTTTCACC 


260 


270 


280 


290 


300 


TGCCTGTATA 


TATGTAAATT 


AATTATAATG 


AACACTCTCA 


CATAAAATAA 


310 


320 


330 


340 


350 


TTATCAGTAT 


ATACATTAAT 


ACTTGCCCTC 


CACAATGAAT 


TAAATAAAAT 


360 


370 


380 


390 


400 


GTAGAACATG 


ATCTACACTT 


CAATAAAACT 


AAGACCATAA 


AGAATAATTT 


410 


420 


430 


440 


450 


CAAAATATAC 


ACATGTCAAC 


AATAAATTAT 


TTGCATATTA 


TATTAACTTA 


460 


470 


480 


490 


500 


CTAAACAATC 


TTTACTTTTG 


AAATATAAAA 


ATAATCAAGT 


TATAAGTCTG 


510 


520 


530 


540 


550 


CTCAAAGTAA 


AGCACTTGTT 


AGACTCATCT 


GATTTTGAGA AGGTAAGCAA 


560 


570 


580 


590 


600 


ATTGATGGTG 


CATAATAGTC 


ACAAGTAAAA 


TATAAAATAG 


ATTTCATTAG 


610 


620 


630 


640 


650 


TAAAATTGTT 


TTTTACTTTC 


TTTATATATA 


ATTATCAATA 


TCCTTCAATG 


660 


670 


680 


690 


700 


GTAGGTTAAT 


TATATTGTTA 


ACTTCTTGTT 


GAATTAAAGC 


AATAAGACAA 


710 


720 


730 


740 


750 


GAATATTAAA 


GATAAAAGAA 


CAATAAAAAT 


AGAAAGACTA AGAGATAAGA 


760 


770 


780 


790 


800 


GTTTTCTTAT 


TCTTCTTTCA 


ATAAGTATCA 


TCAAGTGTAT 


ACAATATAAA 


810 


820 


830 


840 


850 


TTTTTGTATT 


TTTGATCTAT 


CTATTTATAA 


TGTTATATAT AAGCATACAA 


860 


870 


880 


890 


900 


AAGATCAGTC 


ATAAATATGA 


CTTTAATCAT 


GAAAATAATG 


AAAGAGATTA 


910 


920 


930 


940 


950 


TGAAGGCGTA 


AGGTTACTAG 


AATAATAGTC 


ATTAAAAAAA 


GGGGTTATCT 


960 


970 


980 


990 


1000 


TTATAATTGA 


ATAATTGATG 


AAGTAATGGA 


GATAATTAGT 


GAGCATAAAT 


1010 


1020 


1030 


1040 


1050 


TTTTTTAAAA 


AAATGGACAT 


TTACACTATA ATATTTTATA ACACTTTCCC 


1060 


1070 


1080 


1090 


1100 


TTAAACATCT 


AGGTATAAAT 


AATGAGTCTT 


GTCAAAATCT 


TAGTAGGAAA 



FIG. 4-1 



6/12 



1110 1120 1130 1140 1150 

AATTCTGTGA AATTTTTTTA GTGAAAACAA ATGATATAAA TATCTTGAAT 
1160 . 1170 1180 1190 1200 

ACTCATTATT TGTTCTCTCA TTAAAAATCT TATCTGACCT ATAAAATAAA 
1210 1220 1230 1240 1250 

TTATTTGCTC AACTCAAAAT AGTTTTTCAT TCTAAAATTA GTATAATTAT 
1260 1270 1280 1290 1300 

TAGTGAATAT TTAATTAACA TAATTGTATA CTAAGGGGCC TATAAATTGG 
1310 1320 1330 1340 1350 

ATTCTTCTCA AAGAAAAATA AAATCACCAC ACAACTTTCT TCTTCTGCTC 

1360 1370 1381 1390 

ATCAATTAGC AATTAATCCA AAACCATT ATG GCT GCC AAA AAT 

MET Ala Ala Lys Asn 
1399 1408 1417 1426 

TCA GAG ATG AAG TTT GCT ATC TTC TTC GTT GTT CTT TTG 
Ser Glu MET Lys Phe Ala He Phe Phe Val Val Leu Leu 
1435 1444 1454 1464 1474 

ACG ACC ACT TTA GGTTCACAAC ACTTCTCCCT TATTTTGTTT 
Thr Thr Thr Leu 



1484 


1494 


1504 


1514 


1524 


TCTTAATTTC 


TTGGAAGTCA 


TATGCATGTG 


TTTGGTATCA 


TGGTATATAT 


1534 


1544 


1554 


1564 


1574 


ATAAAGGAAA 


ATATTTTTCT 


TAATTACTGG 


TTTTCTAATG 


TTTGGTAGGT 


1584 


1594 


1604 


1614 


1624 


AATCGGAAAT 


TATTATGAGA 


TAATGAACTT 


GCAAAGTCAT 


TATTATATAA 


1634 


1644 


1654 


1664 


1674 


CTTTTTTTTT 


ATACTTTGAT 


TTAAGAATTC 


ATTTTTCTCA 


TTTTATATAA 


1684 


1694 


1704 


1714 


1724 


ACTTATTTTT 


CAACAGAAAA 


TATTTTTCGA 


ACTATTCAAA 


CACACCCTAA 


1734 


- 1744 


1754 


1764 


1774 


GACATTACAT 


ATATATATAT 


ATACACCCTC 


CGTTTTATAT 


TACTTAATGC 


1784 


1794 


1804 


1814 


1824 


CTATTGAGTT 


GGCCCACCCT 


TTAAGAATGA 


TTCAATTAGA 


GATATGTTTT 


1834 


1844 


1854 


1864 


1874 


ACTAAATTAA 


CCTATGCTTT 


AAGACTCTAA 


ATTTGGCTAT 


TACTATTTTA 


1884 


1894 


1904 


1914 


1924 


CGTTGTAATT 


TAATGACAAA 


CATTTCATAA 


TGACTATAGT 


CTGAACTTAA 


1934 


1944 


1954 


1964 


1974 


TTAGACAGAC 


GTATCTATAG 


TTTGCTTACT 


AATGATTCAT 


AGCTATATAT 


1984 


1994 


2004 


2014 


2024 


TTGGAGAGGA 


GAGAGACAAA 


CGATATTAAG 


AAAGGGAGGA 


GAGAGGCGAG 


2034 


2044 


2054 


2064 


2074 


GTAAATCTGA 


AATAGAGAAG 


AGAAAGGCAA 


CCAATTTTGA 


TCATCTATCA 


2084 


2094 


2104 


2114 


2124 


TACTTTTGAT 


TATTATTTTT 


ATTATATGTA 


CGTTTACATT 


ACAGTTTTCG 



FIG. 4-2 
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2134 2144 2154 2164 

AATTCTTACA TTAATCTTAA TCATAATATA TACA GTT GAT ATG 

Val Asp MET 
2173 2182 2191 2200 

TCT GGA ATT TCG AAA ATG CAA GTG ATG GCT CTT CGA GAC 
Ser Gly He Ser Lys MET Gin Val MET Ala Leu Arg Asp 
2209 2218 2227 2236 2245 

ATA CCC CCA CAA GAA ACA TTG CTG AAA ATG AAG CTA CTT 
lie Pro Pro Gin Glu Thr Leu Leu Lys MET Lys Leu Leu 

2254 2263 2272 2281 

CCC ACA AAT ATT TTG GGA CTT TGT AAC GAA CCT TGC AGC 
Pro Thr Asn He Leu Gly Leu Cys Asn Glu Pro Cys Ser 

2290 2299 2308 2317 

TCA AAC TCT GAT TGC ATC GGA ATT ACC CTT TGC CAA TTT 
Ser Asn Ser Asp Cys He Gly He Thr Leu Cys Gin Phe 
2326 2335 2344 2353 2362 

TGT AAG GAG AAG ACG GAC CAG TAT GGT TTA ACA TAC CGT 
Cys Lys Glu Lys Thr Asp Gin Tyr Gly Leu Thr Tyr Arg 

2371 2380 2393 2403 

ACA TGC AAC CTG TTG CCT TGA ACAATATCAA TGATCTATCG 



Thr Cys Asn Leu Leu Pro . 



2413 


2423 


2433 


2443 


2453 


ATCGATCTAT 


CTATCTATTT 


ATCTGTCTCT 


GCGCGTATAG 


TGTTGTCTGT 


2463 


,y 2473 


2483 


2493 


2503 


ACCTTTGGTG 


TGAAGAATAT 


GAATAAAGGG 


ATACATATAT 


CTAGATATAT 


2513 


2523 


2533 


2543 


2553 


TCTAGGTAAT 


GTCCTATTGT 


ATTTAAAATT 


TGTAGCAATG 


ATTGTTTGAA 


2563 


2573 


2583 


2595 


2603 


TAAAAACATA 


CCATGAGTGA 


AATAATTATT 


CCACATTAAT 


TCACGTATTT 


2613 


2623 


2633 


2643 


2653 


ATTTCACTTA 


TGATACGTAT 


TTTTGTTCCT 


TTCGCGTAGA 


TTTTTGATCC 


2663 


2673 


2683 


2693 


2703 


TTTTCCCTTT 


TGAATATTAA 


ACATTAAACA 


CAAATAATGT 


TTATTAAATT 


2713 


2723 


2733 


2743 


2753 


AAGTTAATAT 


TTTTATTTAG 


CTATTTATAT 


TTTTATTTGA 


AATCAAACTT 


2763 


2773 


2783 


2793 


2803 


GATAAATATT 


TATAAAGATA 


ATTAACAAGT 


AATGTGACAC 


TAACACCATG 


2813 


2823 


2833 


2843 


2853 


TAATATTATC 


TTGTCGTTAT 


TTATGATAAT 


ATTTTAAAAT 


TATAATTTCA 


2863 


2873 


2883 


2893 


2903 


GTTAAAAAAT 


TATTAAAAAA 


ACATACTTTT 


AAAAAGTGAG 


TTAGCCTCCG 


2913 


2923 


2933 


2943 


2953 


CTACCCACAT 


ACTTATGAAT 


TGGACTAGTT 


GTTTTTTGAC 


CCACAAAAAG 


. 2963 


2973 


2983 


2993 


3003 


AATGGGCTAA 


TTAAACCTGA 


CCTATCAAAT 


TTCAGAATCT 


GCATAGATTA 



FIG. 4-3 
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3013 


3023 


3033 


3043 


3053 


GTCCGAACGA 


AATGAGTCAG 


CCCGTATTGA 


ACAAAATATC 


AACAAGGACG 


3063 


3073 


3083 


3093 


3103 


TTATGTAAAG 


ATGTTTAAGA 


AGGAAAAAAG 


ATTTCTAATA 


CATATGGACT 


3113 


3123 


3133 


3143 


3153 


TTCAATATCC 


CAACTTTGTC 


TGGCGATCTG 


AACCCTGCTT 


AGTTTGTTGA 


3163 


3173 


3183 


3193 


3203 


TCATTAACTT 


GTCTTGCTAT 


GTATTTAAGA 


TTTAAACTTT 


ATATGTTTAA 


3213 


3223 


3233 


3243 


3253 


ACTTACAGAA 


AATACATATA 


AATCTCTCAA 


GACTTGGCAA 


CATAATTTAC 


3263 


3273 


3283 


3293 


3303 


TTTAGTACTT 


AAACTACATG 


AAAATTTAAA 


TATCCTTTTA 


ACATCTTTGA 


3313 


3323 


3333 


3343 


3353 


AGTGAATTAA 


ATTATCACAA 


TCCGAGCCTA 


CACCTTGGAC 


GTGGCCGGCA 


3363 


3373 


3383 


3393 


3403 


CTCAAGAACC 


AGTGCTGGTC 


CCCAAGCTAA 


CCCTCATCCT 


GACTGACTAC 


3413 


3423 


3433 


3443 


3453 


AAGCGGAAGG 


CTAACTTAAG 


TATACAAAAG 


CTTAAAACTG 


AATAAAATAA 


3463 


3473 


3483 


3493 


3503 


ACTTTACAAG 


GTTTTAACAC 


AAATGAACAA 


CTTTGAAGAA 


AATAATATAT 


3513 


3523 


3533 


3543 


3553 


TCAACTAGCC 


ATAAAATAGA 


CAACTTTAGT 


CTTTAAAACA 


TTTAATAAAA 


3563 


3573 


3583 


3593 


3603 


TAAATGCAAA 


ATATAGACTC 


CTTAACTAAA 


CTGACTATCT 


ATGGAGCCTC 


3613 


■ 3623 


3633 


3643 


3653 


TAATTGATAA 


AGATGGAAGT 


CGGGACAAGA 


CCACGACATC 


CTGACTAAAC 


3663 


3673 


3683 


3 S33 


3703 


TGAGAAGTAA 


ATAAAATCCC 


CCGGAAAAAA 


AGGAGCCTCA 


CCATGGCTAA 


3713 


3723 


3733 


3743 


3753 


CTCGAACTCG 


GGGATATATC 


AATGAAGCTC 


CTGTTGATGA 


TCTTGAAGAC 


3763 


3773 


3783 


3793 


3803 


ATGTCTCTGC 


ATCATCAAAA 


AGATGCAGGC 


CAAATGGCTC 


AGTACGTAAA 


3813 


3823 


3833 


3843 


3853 


ATGTACGAGT 


ATGTAAGGGA 


AATTCTAAAG 


TATAACATAA 


GCTTGATACT' 


3863 


3873 


3883 


3893 


3903 


TGAATAAAAG 


GAAACATACT 


TACCTCTTTT 


CAACTGAACT 


CAAATTAAGA 


3913 


3923 


3933 


3943 


3953 


ATAAGATACT 


CAACTCAAAG 


ATTAGGTATT 


CAACGCAAAT 


ATGGCACTCT 


3963 


3973 


3983 


3993 


4003 


ACTCAATGAA 


GTACAAATTA 


ACTCAGGATA 


CTCGACTTAA 


GATACTCAAC 


4013 


4023 


4033 


4043 


4053 


TCCCGACACT 


CAACTGAACT 


CATTTCAATA 


TAAAGCAGCT 


TAAAACAAGT 


4063 


4073 


4083 


4093 


4103 


TCAGTATAAA 


GTAAAGTTGT 


TTAAAAACAT 


GATGTCAACT 


CTGTGTGTAT 


4113 


4123 


4133 


4143 


4153 


AATAAGGGAT 


ACAACATAAC 


TTTGAAATGT. 


ATATAAAAAT 


ACAATTAACT 



FIG. 4-4 
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4173 


4183 






GATGTATATA 


AAAATACATT 


AATCTATGGG 


AGATTCTCTA 


ACCGACAACC 


4213 


4223 


4233 


4243 


4253 


ATCACTTAAG 


GGCTAAGATG 


ATGATATAGC 


GATCTACCGC 


ACGCTGCCAT 


4263 


4273 


4283 


4293 


4303 


CGCATCTTAT 


ACCCGGCCAA AGGTATAAGA 


CCTGAACTGC 


CTAATGAATC 


4313 


4323 


4333 


4343 


4353 


CACTAATAAA 


CTGTTAAAAG 


GAATCATCTA 


AAAAGTATGA 


CCCTTTTCTA 


4363 


4373 


4383 


4393 


4403 


CCCATAGTGG 


CTAACATGGT 


TTATGGGGGC 


TGTGAGTTAT 


CTGAACTCTC 


4413 


4423 


4433 


4443 


4453 


CCCCATATCG 


GTGCTCAATA 


CTACTCCJ^ 


AAATATACTG 


CTCTTATGTT 


4463 


4473 


4483 


4493 


4503 


TAAAAACATA 


CTGATTCTGT 


GGTTTGAAAT 


TATTGCTTAA 


AGCTTAGATT 


4513 


4523 


4533 


4543 


4553 


TTTGAAAAGC 


TCTCTTTTGA 


AAATCGTAGT 


TTCCTTTTTC 


TTCTATTAAA 


4563 


4573 


4583 


4593 


■■ 4603 


GCTAGACATA 


GGCTATGTAG 


AACTCTAGCT 


TACCTTCCTT 


CTCAAAAGTT 


4613 


4623 


4633 


4643 


4653 


TGAAAACATT 


TGCTTAGATT 


CTTAGGGACT 


ACTTAGTTCC 


CTTGTTGGAA 



TTC 

FIG. 4-5 
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PG GENOMIC 



10 


. 20 


30 


40 


50 


AAGCTTCTTA AAAAGGCAAA 


TTGATTAATT 


TGAAGTCAAA 


ATAATTAATT 


60 


70 


80 


90 


100 


ATAACAGTGG 


TAAAGCACCT 


TAAGAAACCA 


TAGTTTGAAA 


GGTTACCAAT 


110 


120 


130 


140 


150 


GCGCTATATA 


TTAATCAACT 


TGATAATATA 


AAAAAAATTT 


CAATTCGAAA 


160 


170 


180 


190 


200 


AGGGCCTAAA 


ATATTCTCAA 


AGTATTCGAA 
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